Learning Token-Based Representation for Image Retrieval

نویسندگان

چکیده

In image retrieval, deep local features learned in a data-driven manner have been demonstrated effective to improve retrieval performance. To realize efficient on large database, some approaches quantize with codebook and match images aggregated kernel. However, the complexity of these is non-trivial memory footprint, which limits their capability jointly perform feature learning aggregation. generate compact global representations while maintaining regional matching capability, we propose unified framework learn representation our framework, first extract using CNNs. Then, design tokenizer module aggregate them into few visual tokens, each corresponding specific pattern. This helps remove background noise, capture more discriminative regions image. Next, refinement block introduced enhance tokens self-attention cross-attention. Finally, different are concatenated representation. The whole trained end-to-end image-level labels. Extensive experiments conducted evaluate approach, outperforms state-of-the-art methods Revisited Oxford Paris datasets.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supervised Hashing for Image Retrieval via Image Representation Learning

Hashing is a popular approximate nearest neighbor search approach for large-scale image retrieval. Supervised hashing, which incorporates similarity/dissimilarity information on entity pairs to improve the quality of hashing function learning, has recently received increasing attention. However, in the existing supervised hashing methods for images, an input image is usually encoded by a vector...

متن کامل

A Radon-based Convolutional Neural Network for Medical Image Retrieval

Image classification and retrieval systems have gained more attention because of easier access to high-tech medical imaging. However, the lack of availability of large-scaled balanced labelled data in medicine is still a challenge. Simplicity, practicality, efficiency, and effectiveness are the main targets in medical domain. To achieve these goals, Radon transformation, which is a well-known t...

متن کامل

Hypergraph-based image retrieval for graph-based representation

In this paper, we introduce a novel method for graph indexing. We propose a hypergraph-based model for graph data sets by allowing cluster overlapping. More precisely, in this representation one graph can be assigned to more than one cluster. Using the concept of the graph median and a given threshold, the proposed algorithm detects automatically the number of classes in the graph database. We ...

متن کامل

Shape Based Image Representation and Retrieval

mong all the issues related to Content Based Image Retrieval systems, retrieving images based on their shapes is an important one. Many approaches exist utilizing shape representation and comparison, e.g., the methods based on Fourier descriptors. In this thesis, we propose a novel method for shape representation. In our method, we calculate the centroid of a shape and choose a set of sample po...

متن کامل

Content Based Image Retrieval Using Signature Representation

Retrieving relevant images from a large, diversified collection using visual queries (image content) as search argument is a challenging and important open problem. It requires an efficient and effective content-based image retrieval (CBIR) system. Image representation has a profound effect on the performance of CBIR. This paper presents a CBIR system based on a novel image representation using...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i3.20173